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physically handicapped and of visually impaired examinees is most 
similar to the nondisabled population. The performance of learning 
disabled individuals ranks third among the specific disability 
groups, and is appreciably lower than national norms. Hearing 
impaired students perform the least well of all disabled groups on 
admissions tests. The limited data on the reliability of cognitive 
tests for handicapped examinees show no trustworthy difiwrences in 
measurement precision between disabled and nondisabled populations. 
The results of the small number of validity investigations that have 
been conducted also show no dependable differences across 
popuulations . Data on the ACT Assessment and the SAT generally 
support the validity of these measures as equivalent predictors of 
college perf ormiince . (Author/BW) 
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Most admissions testing programs have long made accotmodatlona 
for handicapped examinees, though practices have varied across pro- 
grams and limited research has been undertaken to evaluate such test 
modifications. Regulations under Section 504 of the Rehabilitation 
Act of 1973 Impose new requirements on institutional users, and 
indirectly on admissions test sponsors and developers, in order to 
protect the rights of handicapped persons. The Regulations have not 
been strictly enforced since many have argued that they conflict with 
present technical capabilities of test developers. In 1982, a Panel 
appointed by the National Research Council released a detailed report 
and recommendations calling for research on the validity and compar- 
ability of scores for handicapped persons. 

Due to a shared concern for these Issues, College Board, 
Educational Testing Service, and Graduate Record Examinations Board 
initiated a series of studies in June 1983. The primary objectives 
are: 

To develop an improved base of information concerning the 
testing of handicapped populations. 

To evaluate and improve wherever possible the accuracy of 
assessment for handicapped persons, especially test 
scaling and predictive validity. 

To evaluate and enhance wherever possible the fairness 
and comparability of tests for handicapped and 
nonhandicapped examinees. 

This is one of a series of reports on the project, which will 
continue through 1986. 
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Abstract 

This paper Is the second In a series of reports emanating from a 
four-year research program meant to further knowledge of college 
and graduate admissions testing and handicapped people. The 
purpose of this paper Is to document existing research on the 
test performance of handicapped people with respect to admissions 
and other similar tests. In addition, the psychometric 
characteristics of these tests when used with handicapped people 
are reported. 

Though much more data need to be collected, several 
observations are made. Handicapped students as a general group 
perform appreciably lower than national norms on the SAT and ACT 
Assessment (by about .5 standard deviations). Of the four 
specific disability groups discussed, the admissions test 
performance of physically handicapped and of visually Impaired 
examinees Is most similar to the nondlsabled population. The 
admissions test performance of learning disabled Individuals 
ranks third among the specific disability groups. With limited 
exceptions, the performance of this group Is appreciably lower 
than national norms (genera 1 ly by at least .5 standard 
deviations). Hearing Impaired students perform the least well of 
all disabled groups on admissions tests. 

The limited data on the reliability of cognitive tests for 
handicapped examinees show no trustworthy differences In 
measurement precision between disabled and nondlsabled 
populations. The results of the small number of validity 
Investigations that have been conducted also show no dependable 
differences across populations. Data on the ACT Assessment and 
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the SAT generally support the validity of these measures as 
equivalent predictors of college performance In selected disabled 
student groups and the nonhandicapped population. 

The lack of data on test performance, reliability, and 
validity prohibits drawing definitive conclusions at this time. 
Further research Is necessary before the fairness of the 
admissions testing process for handicapped examinees can be 
soundly evaluated* 
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The College lo^rd, Educational Testing Service <ETS), and 
the Graduate Record Examinations (GRE) Board are collaboratively 

• * 

sponsoring a four^year research program on admissions testing and 
handicapped people* The psogram Is meant to (a) respond to 
Issues raised by/-Sectlon 504 of the Rehabilitation Act of 1973' 
and by the National Research Councll^s Report of the Panel on 
Testing of Handicapped People (Sherman & Robinson, 1982), and (b) 
en'sure that the testing experience and scores oii tests 
administered by ETS to handicapped Individuals are as comparable 
as possible to those of nonhandlcapped examinees. In line with 
this dual purpose, three research and development objectives were 
set* These were to (a) develop an Improved base of Information 
and test data concerning the characteristics of handicapped 
populations, (b) evaluate and Improve, wherever possible, the 
accuracy of assessment for handicapped people> and (c) evaluate 
and enhance, wherever possible^ the fairness and comparability of 
tests for handicapped and nonhandlcapped examinees (see Bennett & 
Ragosta, 1984, for a more complete description of the research 
program); y 

To help satisfy th\e research program's f list objective— . 
developing an Improved base of Information and test data 
concerning the characteristics of handicapped s tudents--two 
background papers were produced* The first paper, A Research 
Context for Studying Admissions Tests and Handicapped Populations 
(Bennett & Ragosta, 1984), focused upon the characteristics of 
the four largest disability groups taking admissions tests: 
students with learning disabilities, physical handicaps, hearing 
Impairments, and visual Impairments* The second, and present, 
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paper selectively documents existing research on the performance 
of these four groups with respect to admissions tests and other 
similar measures. In addition, the psychometric characteristics 
of these measures when used with handicapped examinees are 
reported* 

Studies cited In the paper were selected on the basis of 
several criteria. One criterion w*s the similarity of the 
treasures to college and graduate admissions tes's. Cognitive 
ability tests and academic achievement measures were Included 
because their content and format is similar to admissions tests* 
A secoad criterion was the similarity of the examinees to college 
and graduate applicants. Studies of high school and college-age 
students were particularly sought/ though research on younger 
subjects was sometimes cited when little other relevant work 
could be located. A final criterion was the quality of research. 
Efforts were made to consider only those studies with samples 
large enough to offer dependable results (i.e., more than 40 
subjects). However, because samples used in research on 
handicapped pupils appear to be typically small, this criterion 
was occasionally relaxed when little dependable data could be 
f oun^ . 

In addition to sample size, efforts were made to include 
only studies for which comparable data for nonhandicapped 
subjects were available. Comparable data for nonhandicapped 
subjects are necessary to render research results interpre tnb 1 e. 
For example, the poor SAT performance of deaf students attending 
an institution has limited meaning without knowledge of the 

, 11 



performance of the Institution's general student populat;lon; deaf 
student performance may be poor but so may the performance of the 
school's students generally, suggesting the possibility that poor 
performance Is typical of those attending the Institution and not 
necessarily of the deaf population as a whole* 

The requirement for comparability was satisfied by 
indi vidua 1 studies that compared data for a sample of handicapped 
subjects from a particular Institution or geographical area with 
data for an appropriate control group from the same institution 
or area; by individual studies that compared data for a national 
sample of disabled subjects with normative data for a test or 
other data for a nationwide cross*-sec tlon of the population; or 
by comparisons of data from a series of similar studies of 
handicapped subjects with data from a series of studies of 
nondisabled people, each investigation being based on a sample 
from a particular institution or geographical area. It is 
recognized that this requirement does not guarantee comparable 
data (e.g., handicapped and nonhandicapped groups within the same 
iastltutlon may be selected using different decision rules), but 
merely enhances the likelihood of obtaining groups that are 
similar on important characteristics. Also, as with the sample 
size criterion, the comparability requirement could not always be 
applied without excluding all studies covering a given topic and 
was, therefore, occasionally relaxed. 

The research results presented in the paper are discussed 
first in terms of handicapped students as a general group and 
then in terms of each of the four disability groups most commonly 
taking admissions tests. This approach was chosen to permit a 
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general picture to emerge as well as to allow for the examination 
of differences among the specific disability groups. For both 
general and specific groups, information is presented on level of 
test performance, reliability, and validity. 

Handicapped Students in General 
Level of Performance 

Tour studies present data relevant to the test performance 
of handicapped students in general. Two of these studies concern 
the Scholastic Aptitude Test (SAT), one the American College 
Testing Program (ACT) Assessment, and one the Graduate Record 
Examinations (GRE). 

Ragosta and Nemceff (1982) reported the scores of all 
candidates taking special administrations of the SAT in 1981. 
Special administrations included braille, large-type, cassette, 
extra time, the use of a reader or recorder, or some combination 
of these accommodations (including regular-type with extra time). 
The mean scores for the approximately 6,800 pupils taking special 
administrations were 383 on the Verbal section (SAT-V) and 429 on 
the Mathematical section (SAT-M) for males, and 373 (SAT-V) and 
395 (SAT-M) tor females, respectively. During 1981, 
nonhandicapped males scored 430 (SAT-V) and 492 (SAT-M) while 
females scored 411 and 443, respectively (Ragosta f- Nemceff, 
1982). With the exception of the performance of females on the 
SAT-V, the scores of handicapped students were appreciably below 
(by close to one-half standard deviation) those of the general 
SAT-taking population. 

An investigation of the performance of handicapped students 
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who elected to take both timed and untlmed versions of the 
standard SAT was conducted by Centra (1983). Centra's sample of 
over 1,400 pupils constituted a subset of the Ptudents taking 
flpeclal administrations Included In the Ragosta & Nemceff (1982) 
study described above. Centra found the mean tlwc-' score to be 
3A4 and the mean untlmed score to be 401 for the Verbal section 
of the test. For the Mathematical section, scores were 400 for 
the timed administration and 459 for the untlmed administration. 
Scores from the timed administration were appreciably different 
(over .5 standard deviations) from those of all high school 
seniors taking the SAT during the same period (424 to 427 for 
SAT-V, 466 to 467 for SAT-M, Educational Testing Service, 1982). 
In contrast with Ragosta and Nemceff's (1982) results, scores 
from the untlmed administration were only marginally different 
(less than .25 standard deviations) from the performance of high 
school seniors. 

Centra found handicapped students to post appreciable score 
gains on the modified test over the timed administration. Gains 
averaged between 30 and 38 points after typical practice, growth, 
and measurement error effects had been taken into account. 
According to Centra, these gains were greater than those reported 
for nonhandicapped students given extra time (Hale, 1982), 
supporting the notion that time limits put disabled students at a 
disadvantage relative to the general test-taking population. 

Aside from the College Board's Scholastic Aptitude Test, 
data have been reported by the American College Testing Program 
for the ACT Assessment. Maxey and Lev?tz (1980) reported 
performance data for a sample of approximately 10,000 disabled 



students taking the standard version of the ACT Assessment on a 
national test date during the years 1975-76, 1976-77, and 1978- 
79« According to these Investigators, disabled students 
consistently achieved a composite score about a half-standard 
deviation lower than the national test norms (mean scores for 
handicapped students were 15#4, 16*1, and 16*0; scores for others 
were 18*3, 18*4, and 18*6). Data for the years 1 979 and 1980 
show similar differences (Maxey, 1982)* These performance 
differences are consistent with ACT data suggesting that 
handicapped students earn lower first year college grades (Maxey 
& Levltz, 1980). 

The final study relevant to handicapped students In general 
was reported for the standard version of the Graduate Record 
Examinations Aptitude Test (GRE) by Wild (1982). For the 1981- 
1982 testing year, the mean scores of over 2,000 disabled 
examinees were 487 for the GRE-Verbal, 510 for the GRE- 
Quantitative, and 513 for the GRE-Analytlcal» Scores for all 
examinees taking the GRE during the same period were 501, 539, 
and 539, respectively. The differences between the performance 
of handicapped and nondlsabled students on the GRE (less than .25 
standard deviations) were not as discrepant as the differences 
generally evidenced between those groups on undergraduate 
admissions tests. The greater similarity In scores on the GRE 
may be due to a more select population of handicapped students 
applying to graduate school. 
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Reliability 

Little data on the reliability of admlsslcns tests for 
handicapped students In general Is available (Bennett, 1983)* A 
preliminary Idea of the reliability of admissions tests Is given 
by the Centra (1983) study described above. Centra reported 
correlations between the SAT scores that disabled candidates 
obtained In timed and untlmed administrations. Correlations of 
.79 and .85 for the SAT-V and SAT-M, respectively, were obtained 
using a sample of over 1,800 subjects. These correlations 
necessarily underestimate the parallel-form reliability of the 
SAT for handicapped examinees as the two test administrations 
differ substantially In speededness as well as In other variables 
(Educational Testing Service, 1980). Still, only the SAT-V 
correlation was appreciably lower than the corresponding 
correlation between SAT scores In standard administrations for 
national samples of candidates: .88 to .91 for SAT-V and .85 to 
.88 for SAT-M (Donlon & Angoff, 1971). 
Validity 

As for reliability, little data on the validity of 
admissions tests for handicapped students In general appear to 
exist (Bennett, 1983). Data for the ACT Assessment from the 
1976-1977 Basic and Standard Research Service files showed 
approximately equivalent correlations between the ACT Composite 
and college grade point average (CPA) In both handicapped (.46) 
and nonhandlcapped (.A4) student populations (Maxey & Levltz, 
1980). For both groups, the combined ACT Composite score and 
self -reported high school average predicted college grades 
equally well (multiple R - .59). When actual and predicted 
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grades were comparedi a slight underpredlct Ion occured for 
handicapped students (actual grade median - 2*37, predicted « 
2*35) while a slight oveipredlctlon occured for the general 
population (actual grade mean • 2.5A> predicted - 2.58). These 
prediction errors were so small as to be of little practical 
significance • 
Summary 

Data from both the SAT aud ACT Assessment Indicate that 
handicapped students as a group perform about one-half standard 
deviation below national norms* Lower performance Is most 
evident for those handicapped students taking standard 
administrations and those taking only modified tests* Examinees 
taking both timed and untlmed tests show scores only marginally 
different from national norms on the untlmed test. Disabled 
students taking the standard version of the GRE also show only 
marginal differences In performance from national test norms 
(less than .3 standard deviations). 

Data on the reliability and validity of admissions tests for 
handicapped students as a group are largely absent. PrellmJ ary 
data for the SAT show no dependable differences In precision 
between handicapped and nondlsabled populations. Data from the 
American College Testing Program support the validity of the ACT 
Assessment as an equivalent predictor of college grade point 
average for both disabled and nonhandlcapped students. 
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Learnlnt^ Disabled Students 
Le ve 1 of Performance 

The most relevant data on the level of performance of 
learning disabled examinees on ability tests come from studies of 
the Scholastic Aptitude Test (SAT). In an Investigation of 
over 1,100 candidates with learning disabilities who chose to 
take the SAT In both a timed as well as an untlmed administration 
between 1979 and 1982, the candidates' mean SAT-V score was 338 
and their mean SAT-M score was 393 In the standard 
administration; the corresponding means In the untlmed 
administration were 394 and 452, respectively (Centra, 1983). 
The means for the standard administration were substantially 
below (by over .5 standard deviations) those for all high school 
seniors who took the SAT during the same period (424 to 427 for 
SAT-V, 466 to 467 for SAT-M, Educational Testing Service, 1982), 
whereas the means for the untlmed administration were closer to 
those for high school seniors (less than .3 standard deviations 
away ) . 

In contrast to Centra's study, which was limited to 
candidates who took the SAT in both standard and syjeclal 
administrations, markedly different results were obtained In an 
Investigation that Included all 4,044 candidates with learning 
disabilities who took the SAT in special administrations between 
1979 and 1981 (Ragosta & Nemceff, 1982). (About a quarter of the 
examinees in the Ragosta & Nemceff study were also in the Centra 
investigation.) The mean SAT-V score was 372 for males and 353 
for females, and the mean SAT-M score was 417 for males and 377 
for females. These means were well below (at least .5 standard 
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deviations) those for nonhandlcapped students who took the SAT 
during 198 1 (430 on SAT-V and 492 on SAT-M for males, and 411 and 
443, respectively, for females, Ragosta & Nemceff, 1982). The 
difference between the two Investigations Is presumably due to 
the atypical nature of the learning disabled candidate subgroup 
that elected to take the SAT In a standard as well as an untlmed 
administration* 

In a study of freshmen In a small college, the mean SAT-*V 
score was 352 and the mean SAT-M score was 382 for 55 learning 
disabled students, whereas the corresponding means for the 111 
nonhandlcapped students were 411 and 405 (Jones & Ragosta, 1982)# 
The mean difference for SAT«*V was statistically significant and 
appreciable (over .5 standard deviation units), but the mean 
difference for SAT--M was not significant (less than .3 standard 
deviation units). It is unknown what proportion of the SAT 
scores for the learning disabled students came from special 
administrations* 

Data on the performance of learning disabled students have 
also been reported for individually-administered tests of 
co;>nltlve ability such as the Wechsler Adult Intelligence Scale 
(WAIS) (Wechsler, 1955) and Wechsler Intelligence Scale for 
Children (WISC) (Wechsler, 1949). In an inves t Igat ion of 
students enrolled in a large university (Cordonl, O^Donnell, 
Ramlnlah, Kurtz, & Rosensheln, 1981), learning disabled and 
nonhandlcapped students differed appreciably (by about .5 
standard deviations) on their mean Full Scale WAIS IQ: the means 
were 108 for the former and 115 for the latter* (Verbal and 
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Performance IQ data were not reported.) The two groups also 
differed significantly and substantially in their means on 5 of 
the 11 WAIS subtests — Information, Similarities, Vocabulary, 
Digit Span, and Digit Symbol, with the means for the learning 
disabled students being consistently lower. 

In a study of 14 year old males (Ackerman, Dykmaa, & Peters, 
197 7), the mean Full Scale IQ and mean Verbal IQ on the WISC were 
significantly and appreciably lower (by at least .6 standard 
deviations) for the learning disabled than for the nonhandicapped 
pupils: 103 vs. 112 for Full Scale IQ and 101 vs. 112 for Verbal 
IQ. The mean Performance IQ was not significantly 'different for 
the two groups. 

Aside from overall performance, the subtest and subscale 
performance of learning disabled students has been investigated. 
A review of a large number of WISC Investigations found that 
learning disabled children (specifically, those with reading 
disabilities) typically had their highest mean scores on the 
Block Design, Object Assembly, and Picture Completion subtests 
and their lowest mean scores on the Digit Span and Coding 
subtests; the subtest means for nonhandicapped children followed 
no consistent pattern (Rugel, 1974). In the Ackerman et al. 
study, tVe means on 4 of the 11 WISC subtests — Arithmetic, 
Coding, Information, and Digit Span — were also significantly and 
appreciably lower for the learning disabled males. (Two of these 
subtests. Information and Digit Span, and the WAIS counterpart 
[Digit Symbol] of a third subtest. Coding, also differentiated 
learning disabled and other college students in the previously 
described study by Cordoni et al.) These same four WISC subtests 
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comprise the "ACID" cluster (an acronym composed of the first 
letter of each subtest name) reported to be clinically associated 
with learning disability in children (Sattler, 1982). 

With respect to subscale performance, an extensive review of 
WISC-R research found that both the Verbal IQ-Perf ormance IQ 
discrepancy and subtest scatter were slightly larger for children^! 
with learning disabilities than for nonhandicapped children 
(Kaufman, 1981). However, differences in the amount of scatter 
and subscale discrepancy were too small to be of diagnostic 
value . 

Pertinent data on the level of performance of learning 
disabled examinees on achievement tests are sparse. In the 
Ackerman et al. (1977) study of 14 year old males already cited, 
the means for all three scales on the Wide Range Achievement Test 
(WRAT) (Jastak & Jastak, 1965) — Reading, Spelling, and Arithmetic 
— were appreciably lower for the learning disabled than for the 
other students. 
Reliability 

Useful data on the reliability of cognitive tests for 
learning disabled examinees are limited. The best data come from 
the Centra (1983) study described earlier. This investigation 
reported the correlations between the SAT scores that 1,400 
learning disabled candidates obtained in standard and special 
administrations: .76 for SAT-V and .84 for SAT-M. Because the 
two test administrations materially differ in time limits and 
other variables (Educational Testing Service, 1980), these 
correlations necessarily underestimate parallel-form reliability. 
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Nonetheleps, only the SAT-V correlation was appre<i^ably lower 
than the corresponding correlation between SAT scores in standard 
administrations fot national samples of candidates: .88 to .91 
for SAT-V and .85 to .88 for SAT-M (Donlon & Angoff, 1971). 

McCullough and Zaremba (iy79> reported findings about the 
internal-consistency reliability of the Woodcock Reading Mastery 
Test (Woodcock, 1973) and the KeyMath Diagnostic Arithmetic Test 
(Connolly, Nachtman, & Pritchett, 1971) for a sample of 12 to 17 
year old males. The Coefficient Alpha reliabilities of the total 
scores on these tests were similar for the learning disabled and 
the other pupils in the sample: .88 for the learning disabled and 
.92 for the others on the Woodcock Reading Mastery Test, and .94 
and .97, respectively, on the KeyMath Diagnostic Arithmetic Test. 
Validity 

The most pertinent findings about the validity of cognitive 
tests for learning disabled examinees stem from the Jones and 
Ragosta (1982) study of SAT scores for freshmen attending a small 
Massachusetts college. SAT-V and SAT-M had similar correlations 
with first-semester grade-point average for 55 learning disabled 
and 111 other students: for the former group, .32 for SAT-V and 
.29 for SAT-M; for the latter group, .34 and .28. However, a 
regression equation based on the SAT data for the nonhandicapped 
students systematically overpredicted the grades for the learning 
disabled students (i.e., i>romised higher grades than actually 
achieved). The amount of o verprediction, though statistically 
significant, was of little practical import (.27 of a grade 
point ) . 

Factor analyses of the WISC-R (Wechsler, 1974) for children 

13 22 

ERIC 



of e lementary*8chool age also provide some Insight Into the 
validity of cognitive tests for learning disabled pupils* 
Several investigations of learning disabled examinees (Blaha & 
Vance, 1979; Naglleri, 1981; Petersen & Hart, 1979; Schooler, 
Beebe, & Koepke, 1978; Wallbrown & Blaha, 1979) generally found 
the same factor structure that had been Indentified In the WISC-R 
standardization sample (Gutkln &' Reynolds, \981 ; Kaufman, 1975; 
Ramlniah, O'Donnell, & Rlblch, 1976; Reynolds & Gutkln, 1980; 
Sllversteln, 1977; Wallbrown, Blaha, Walibrown, & Engin, 1975). 
This structure consisted of a large verbal comprehension factor 
(mainly defined by the Information, Similarities, Vocablilary, and 
Comprehension subtests); a large perceptual organization factor 
(primarily defined by the Picture Completion, Picture 
Arrangement, Block Design, Object Assembly, and Mazes subtests); 
and oftentimes a smaller, less stable, freedom from distracti* 
billty factor (usually defined by the Arithmetic, Digit Span, and 
Coding subtests). / 



Factor analyses of the Woodcock Reading Mastery Test 
(Woodcock, 1973) and the KeyMath Diagnostic Arithmetic Test 
(Connolly et al., 1971) were conducted by McCullough and Zaremba 
(1979). In their study of 12 to 17 year old males described 
above, the factor structure of the Woodcock Reading Mastery Test 
was the same for the learning disabled and the other students: 
one factor (defined by all the subtests) emerged. However, the 
factor structure of the KeyMath Diagnostic Arithmetic Test 
differed for the two groups. Two factors were identified for 
learning disabled pupils, an operations factor (defined by the 
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Addition, Subtraction, Multiplication, and Division subtests) and 
a content and application factor (defined by most of the other 
subtests: Numeration, Fractions, Geometry and Symbols, Word 
Problems, Missing Elements, Money, Measurement, and Time). Only 
one factor (defined by all the subtests) emerged fot the other 
subjects* 
Summary 

With the exception of those who take both standard and 
untimed administrations, learning disabled examinees generally 
perform well below (by at least .5 standard deviations) 
nonhandicapped examinees on the Scholastic Aptitude Test. 
Similar performance differences (at least .5 standard deviations) 
are found for this group on individually-administered tests, such 
as the Wechsler Intelligence Scales (Full Scale and Verbal 
Scale), and the Wide Range Achievement Test. Studies of intra- 
test performance on the Wechsler Scales suggest that learning 
disabled students evidence only slightly larger Verbal- 
Performance discrepancies and subtest scatter, but consistently 
poorer performance on the "ACID" cluster, than nondisabled 
s tudent s . 

Little data exist on the reliability of cognitive tests for 
learning disabled students. Prelimary data on admissions tests 
do not show any consistent differences in measurement precision 
between the learning disabled and general student populations. 
Achievement tes-ts, such as the Woodcock Reading Mastery Test and 
the KeyMath Diagnostic Arithmetic Test, have also shown 
equivalent precision across populations. 

The validity of cognitive tests for learning disabled 
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students has been most thoroughly Investigated with respect to 
the WISC-R* Studies have repeatedly found the factor structure 
of this test when used with learning disabled examinees to mirror 
that found in the standardization sample. Results from a single 
study of the SAT suggest similar levels of predictive power for 
the test across populations* Finally, a single Investigation 
Into the structure of achievement tests produced equivalent 
factor structures for the Woodcock but not for the KeyMath. 

Physically Handicapped Students 
Le ve 1 of Performance 

The best data on the level of performance of physically 
handicapped examinees on admissions tests come from the Centra 
(1983) and the Ragosta & Nemceff (1982) studies of the SAT. For 
96 candidates with physical disabilities taking the SAT in both 
timed and untlmed administrations, the SAT-V mean was 398 ;ind the 
SAT-M mean was 428 in the standard administration; the correspond* 
ding means were 446 and 486 in the untlmed administration 
(Centra, 1983)* Means from the timed administration were 
somewhat below (i.e., within .35 standard deviation units) while 
those from the untlmed test were above (i.e., up to .19 standard 
deviations) those for all high school seniors (424 to 427 for 
SAT-V, 466 to 467 tor SAT-M) (Educational Testing Service, 1982). 

Similarly, for all 568 candidates with physical disabilities 
taking the SAT in special administrations (approximately a fifth 
of these examinees were in the Centra Investigation), the SAT-V 
means were 424 for males and 410 for females, and the SAT-M means 
were 460 for males and 410 for females (Ragosta & Nemceff, 1982). 
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These sets of means are close (within .3 standard deviations) to 
the means for nonhandicapped students (430 on SAT-V and 492 on 
SAT-M for males, and 411 and 443, respectively, for females, 
Ragosta & Nemceff, 1982). 
Reliability 

Useful data about the reliability of ability tests for 
physically handicapped examinees are extremely limited. The most 
relevant data come from the Centra (1983) study, which reported 
the correlations between the SAT scores for 129 physically 
handicapped examinees in standard and untimed administrations. 
The correlations were .83 for SAT-V and .86 for SAT-M. These 
lower-bound estimates of parallel-form reliability were similar 
in level to the retest reliability of the SAT in standard 
administrations for national samples of examinees (.88 to .91 for 
SAT-V and .85 to .88 for SAT-M, Donlon & Angoff, 1971). 
Validity 

No relevant data could be located on the validity of 
cognitive tests far physically handicapped examinees. 
Summary 

Data on both the test performance of physically handicapped 
people and the psychometric characteristics of tests used with 
them are extremely limited. The little data that do exist 
suggest that these individuals perform relatively close to 
national noi^ms on standard and on untimed administrations of the 
Scholastic Aptitude Test (i.e., within .35 standard deviation 
units). Preliminary reliability data suggest that the SAT 
measures with equal precision for physically handicapped and 
nondisabled groups. No validity data could be located. 
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Hearing Impaired Students 

Level of Performance 

Ragosta and Nemceff (1982) studied over 250 hearing-Impaired 
candidates taking the SAT In nonstandard administrations between 
1979 and 1981« The mean SAT^V scores were 301 for males and 299 
for females, and the mean SAT-M scores were A03 for males and 381 
for females* The means for hearing-Impaired students were well 
below the means (between .5 and 1«2 standard deviations) for 
nondlsabled students (430 on SAT-V and 492 on SAT-M for males, 
and 411 and 443, respectively, for females, Ragosta & Nemceff, 
198: . 

Jones and Ragosta (1982) studied 108 hearing-Impaired 
students attending a stat^ university In California* These 
Investigators found means for the hearing Impaired sample to be 
355 tor SAT-V and 413 for SAT-M* Corresponding means for the 259 
hearing 8tud.ents attending the same school were 455 and 480, 
respectively* Differences In means between the two groups were 
statistically significant and appreciable (between *6 and 1*1 
standard deviation units)* In addition, Jones and Ragosta found 
mean SAT-V scores to increase with delay in onset of deafness 
(319 for those with onset at birth, 383with onset 'in preschool, 
and 466 with onset at age 5. and over)* 

Additional information on the cognitive test performance of 
deaf and hearing-impaired students comes from studies using 
nonverbal, individually-administered tests such as the 
Performance Scales of the WISC (Wechsler, 1949), WISC-A 
(Wechsler, 1974), and WAIS vWechsler, 1955). Schlldroth (1976) 
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reported the nonverbal IQ scores of more than 21,000 students 
enrolled In programs for the hearing impaired in the 1970 to 1971 
school year. The mean nonverbal IQ for all hearing impaired 
students was 95.6. For the subsample of hearing Impaired 
children with no additional handicapping condition, the mean 
nonverbal IQ was 100.1. Students with at least one additional 
handicapping condition registered a mean score of 86.5. 
Schlldroth's data are supported by Meadow (1980) who, in 
reviewing several studies, concluded that deaf children usually 
score within the average range on the Wechsler Performance 
Scales . 

Scores on both the WAIS Performance and Verbal Scales were 
reported by Ross (1970) for deaf students age 16-21. Testing was 
accomplished using simultaneous manual and verbal communication 
with responses accepted in either mode. Ross found that the mean 
Performance IQ was 106. The mean verbal IQ was 72, a score well 
below (by almost two standard deviations) the performance of the 
general population. 

Studies of the subtest performance of deaf children suggest 
that these pupils get their best scores on the Object Assembly 
and Block Design subtests of the WISC-R Performance Scale 
(Vonderhaar & Chambers, 1975; Hess, 1960). They seem to perform 
significantly more poorly than their hearing peers on the Coding 
and Picture Arrangement subtests (Anderson & Slsco, 1977). 

Data on the achievement test performance of deaf students 
come from the 1974 national standardization of the Special 
Edition of the 1973 Stanford Achievement Test for Hearing 
Impaired Students (Trybus & Karchmer, 1977). Trybus and Karchmer 
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reported the median reading comprehenalon score at the test's 
highest age level (20+ years) to be at grade level A»5, with only 
10 percent of hearings-Impaired 18 year^olds scoring at an 8th«» 
grade level* In mathematics computation, the median score at age 
20+ was just below 8th grade level, with only the top ten percent 
of hearing-Impaired students scoring at the level of hearing 
students* According to Trybus and Karchmer, these results are 
consistent with data obtained In 1969 and 1971 from samples of 
12,000 and 19,000 children, respectively* 
Reliability 

Reliability data on the SAT for deaf candidates Is reported 
by Centra (1983)* For 34 candidates with hearing Impairments who 
took the SAT In both standard and untlmed administrations, 
estimates of reliability were •89 for the SAT-V and •79 for the 
SAT-M^ Because of differences In test administration, these 
correlations underesclmar . parallel-form reliability^ Compared 
to a national sample of nonhandlcapped candidates (•SS to ^91 for 
SAT-V; .85 to ^88 for SAT-M) (Dor.lon & Angoff, 1971), the 
reliability of SAT-V for deaf examinees Is equivalent to that for 
the general population while the reliability of the SAT-M Is only 
slightly lower^ Given the small number of deaf students used to 
generate these reliability estimates, the above comparisons 
should be viewed with caution^ 

In addition to the Scholastic Aptitude Test, reliability 
data are reported for the Stanford Achievement Test for Hearing 
Impaired Students (SAT-HI) by Jensema (1978)^ Internal 
consistency coefficients for the SAT-HI subtests ranged from a 
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low of .64 to a high of .96 with most estimates in the .808. 
These estimates are roughly comparable to reliabilities typically 
found for the Stanford Achievement Test when used with 
nondlsabled students. Subtest reliabilities for Intermediate 
Form E range from .42 to .84, with most values falling in the 
.60s and .708 for fifth grade students (Gardner, Rudman, Karlsen, 
& Merwln, 1982). 
Validity 

The most pertinent findings about test validity for deaf 
pupils come from Jones and Ragosta's (1982) study of the 
Scholastic Aptitude Test (SAT) in a California state university. 
These researchers found correlations with freshman grade-point 
average of .14 and .41 for SAT-V and SAT-M, respectively, for 60 
deaf students, and .38 and .32 for SAT-V and SAT-M for 140 
hearing examinees. The SAT validity coefficients for the two 
groups were not significantly different, though the difference 
between the SAT-V coefficients approached statistical 
significance (p-.059) and warrants further investigation. When 
combined with high school grade point average, the SAT predicted 
freshman performance in the hearing impaired and hearing 
populations equivalent ly. 

Research on the validity of the Gallaudet Entrance Exam is 
reported by Greenberg and Greenberg (1971). The exam, used at 
Gallaudet College, consisted of 13 tests including some 
commercial measures and some developed especially for deaf 
students. The tests were selected or designed to measure subject 
matter knowledge (e.g., algebra, vocabulary), special skills 
(e.g., writing, paragraph arrangement), and academic potential 
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(e«g«» concept formation)* Of the 13 te8t8» the best predictors 
of college performance In the Gallaudet class of 1965 were verbal 
measures: vocabulary and writing* Each measure correlated *4A 
with college grade point average* Similar correlations were 
observed for the following year's class (Greenberg & Greenbergi 
1971)* 
Summary 

The available research on the performance of hearing 
Impaired students suggests that these pupils perform well below 
the general population on the Scholastic Aptitude Test, on the 
Verbal portion of the Wechsler Adult Intelligence Scale, and on 
reading comprehension and mathematics computation sections of the 
Stanford Achievement Test for Hearing Impaired Students* The 
discrepancy between the performance of the two populations Is 
consistently more pronounced on the verbal portions of these 
measures than on the math or performance sections, a finding 
consonant with the well-documented natural language deficiencies 
of deaf Individuals (Meadow, 1980)* 

Data on the reliability of admissions tests for deaf 
students are largely lacking* Data from one study show no 
dependable differences in measurement precision between hearing 
Impaired and general student grcups* Data for the Stanford 
Achievement Test also show roughly comparable reliability values 
across populations* 

As is the case for reliability, validity data are largely 
absent. A preliminary study conducted at a single institution 
found the SAT combined with high school grades to predict college 
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performance equlvalently in hearing-impaired and hearing 
populations. In addition, studies conducted at Gallaudet College 
showed the best predictors of success at that institution to be 
' verbal measures. 

Visually Impaired Students 
Level of Performance 

Data on the admissions test performance of visually-impaired 
people come from studies of the Scholastic Aptitude Test and ACT 
Assessment. Ragosta and Nemceff (1982) rv%ported scores for 
almost 1,500 visually impaired candidates taking nonstandard SAT 
administrations between 1979 and 1981. The mean SAT-V scores 
were 420 for males and 411 for females and the mean SAT-M scores 
were 459 and 421 for males and females, respectively. These 
scores were only slightly different (within .3 standard 
deviations) from those for nondisabled students taking the 
test during 1981 (430 on SAT-V and 492 on SAT-M for males, 
and 411 and 443, respectively, for females, Ragosta & Nemceff, 
1982). 

In his Investigation of handicapped candidates taking the 
SAT in both timed and untimed administrations. Centra (1983) 
found the mean SAT-V and SAT-M scores for some 177 visually 
impaired candidates to be 371 and 432, respectively, in standard 
administrations. Corresponding means for the nonstandard, 
untimed administrations were 433 for the SAT-V and 494 for the 
SAT-M. Only the score from the SAT-V standard administration was 
appreciably below (by .5 standard deviation units) that achieved 
by high school seniors during the same period (424 to 427 for 
SAT-V, 466 to 467 for SAT-M, Educational Testing Service, 1982). 
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The American College Testing Program (ACT) reported on the 
performance of 412 visually Impaired candidates taking special 
test administrations in the 1979-'1980 academic year (Maxey & 
Levltz, 1980). On English, Mathematics, Social Studies, and 
Natural Sciences tests, visually handicapped students achieved 
mean scores of 16.1, 16.6, 17.7, and 19.7, respectively. 
Nondlsabled students scored comparably (i.e., within .1 to .4 
standard deviation units), achieving means of 17.9, 17.4, 17.2, 
and 21.0, respectively, on the same measures. The mean composite 
score for the visually impaired was 17.7, comparing favorably 
with a composite of 18.5 for the nonhandlcapped (i.e., a 
difference of less than .2 standard deviations). 

In addition to performance on admissions tests, the 
performance of visually Impaired people on the Verbal Scales of 
the Wise (Wechsler, 1949) and WAIS (Wechsler, 1955) has been 
investigated. In general, the scores of visually impaired people 
appear equivalent to'those of the sighted population (Vander 
Kolk, 1977; Jordan & Felty, 1968; Smlts & Mommers, 1976j Tillman, 
1967). Similar results have been found using the Hayes-Blnet and 
Interim Hayes-Blnet (Hayes, 1941). Characteristically, these 
studies have also found higher than average scores on Digit Span 
for visually impaired examinees and lower than average 
performance on the Comprehension subtest. Finally, no 
differences between the performance of blind and partially 
sighted examinees appear evident (Vander Kolk, 1977). 

While the performance of visually impaired people appears 
equivalent to the general population on academic ability tests. 




data on achievement test performance seems more discrepant* For 
example, Ozlas (1975) reported that three separate Investigations 
found blind children well below the mathematics achievement level 
of their sighted peers. Hallahan and Kauffman (1978), In a 
review of the characteristics of visually Impaired children, 
concluded that both partially sighted and blind children achieve 
more poorly than their sighted peers when matched on mental age. 
Finally, the Special Assessment of Visually Impaired Students of 
the Florida Statewide Assessment Program (Florida Department of 
Education 1981), reported that the achievement of visually 
disabled students tested In 1979 and 1980 was well below the 
achievement of regular students. 

The cause of the reported discrepancies between the ability 
and achievement test performance of visually Impaired students Is 
not Immediately clear. Such discrepancies may be due to 
differences In the segments of the visually Impaired population 
that take ability and achievement tests (e.g., a more select 
segment may take ability tests), the use of achievement tests 
with content or administrative procedures that are less 
appropriate for assessing the skills of visually impaired 
students (e.g., group versus individually administered measures), 
or a real difference between general cognitive ability and 
accumulated academic knowledge. 

Limited evidence that modifications in achievement test 
content and procedure affect the performance of visually impaired 
students was provided by Trlsmen ( 1967). In a study desi.gned to 
produce an equivalent w.rm of the Sequential Tests of Educational 
Progress for blind students, this investigator found the 
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performance of over 200 blind students to be roughly comparable 
to national norms when time limits had been relaxed and 
unsuitable items removed* 
Reliability 

Reliability data on the Scholastic Aptitude Test for 
visually Impaired candidates Is reported by Centra (i983)« For 
218 candidates who took the SAT under both timed and untlmed 
conditions, estimates of re 1 labl 1 1 ty were ,83 for the SAT-V and 
•84 for the SAT-M. Because of the differences In test 
administration,, these correlations underestimate parallel-form 
reliability. They are, nevertheless, comparable to the 
reliabilities reported for a natl^onal sample of sighted 
candidates: .88 to .91 for SAT-V and .85 to ,88 for SAT-M 
(Donlon & Angoff, 1971). 

Tillman (1973) reported the reliability results from several 
studies of the WISC Verbal Scale when used with blind students. 
Internal consistency reliabilities on the WISC subtests ranged 
from .71 to •83* For the Verbal Scale, the split-half 
reliability was estimated to be «89 and the test-retest 
reliability was •91. These results are generally comparable to 
those reported for the WISC standardization sample. Split-half 
reliabilities for that sample range from .50 to .91 for the 
Verbal Scale subtests and .88 to .96 for the Verbal Scale IQ 
(Wechsler, 1949). 

The test-retest reliability of the Interim Hayes-Blnet used 
with visually handicapped students was reported by Hsyes (1950) 
to be .90. Preliminary estimates of the split-half reliabilities 
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for the new Perklns-Blnet were reported by Coveny (1972) to be 
.96 for braille readers on Form N and .94 for print readers on 
Form U. These estimates are generally comparable to those found 
In uses of the S tanf ord-Blnet with nondlsabled students, for 
which alternate-forms reliabilities range from .83 to .98 (Terman 

& Merrill* 1973). 

Reliability coefficients for several achievement tests 
modified for use with blind students and standardized on students 
at Perkins Ii^stitution are reported by Bauman and Hayes (1951). 
Retests with forms D-H of the Stanford Achievement Tests yielded 
coefficients from .87 .to .95. Form R of the Metropolitan 
Achievement Test produced a tetrachorlc correlation of .98 with 
Form S taken two years later at Perkins. These reliabilities are 
roughly comparable to those typically found for the general 
population. For the nondlsabled population, retest reliabilities 
for the Stanford Achievment Tests range from .87 to .93 on 
Intermediate 1 Forms E and F (Gardner, Rudman, Karlsen, & Merwln, 
1982). For the Metropolitan, representative Internal consistency 
estimates range from .79 to .93 for the Advanced 1 when used with 
eighth grade students (Farr, Prescott, Balow, & Hogan, 1978). 
Validity 

Very little data on the validity of cognitive tests for 
blind or partially sighted students exist. Lewis (1957) 
investigated the validity of the Interim Hayes-Blnet using 
academic achievement as one validity criterion. She found 
correlations of .45, .46, and .53 between the Interim Hayes-Blnet 
and grade averages in elementary, junior high, and high school, 
respectively. These results suggest a moderately strong 
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relationship between the test and school grades. 

Coveny (1973) undertook a predictive validity study of the 
Perkins-Binet (PB) and the WISC Verbal Scale as predictors of the 
academic achievement of visually handicapped children. A 
multiple regression analysis showed both the PB and WISC to 
predict teacher grades and Stanford Achievement Test subscores. 
Percentages of variance (r^) accounted for by PB subtests ranged 
from .08 to .23 for teacher grades, and .15 to .41 for Stanford 
subscores. The equivalent figures for the WISC ranged from .16 
to .34 and .20 to .40, respectively. 
Summary 

From the available data, the performance of visually 
Impaired students on special administrations of admissions tests 
appears to be close to that of the general candidate population. 
Only on the standard SAT-V do the scores of visually handicapped 
students seem appreciably lower than those of sighted students. 
On individually administered tests of cognitive ability, such as 
the WISC Verbal S ile, visually-impaired examinees achieve scores 
equivalent to those of the national norm group. Performance on 
achievement tests appears to be below national norms. 

The existing research on the reliability of cognitive tests 
for visually impaired pupils suggests values comparable to that 
for the sighted population for a variety of tests including the 
Scholastic Aptltuda Test, WISC Verbal Scale, Interim Hayes Blnet, 
Stanford Achievement Tests, and Metropolitan Achievement Tests. 
Though a few studies suggest significant relationships between 
individually-administered cognitive tests and grades, data are 
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largely lacking on the validity of cognitive tests for visually 
impaired examinees* 

Conclualon 

This paper has documenced the available data on the 
performance of handicapped people on admissions and similar 
tests* Additionally, information on the psychometric 
characteristics of these measures was reported* Though much more 
information needs to be collected, some tentative observations 
can be m|ide from the existing data* 

Table 1 lists performance differences between handicapped 
examinees and the general population in standard deviation (SD) 
units* As the table suggests, the admissions test performance of 
physics 1 ly handicapped students and of visually impaired 
examinees is most similar to the nondisabled population* As 
noted earlier, the available data suggest visually impaired 
students only perform appreciably lower than national norms on 
the standard SAT-V* Their performance on individually 
administered verbal ability tests is generally equivalent to 
national norms, though they appear to do more poorly on 
achievement measures* 
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The admissions test performance of learning disabled 
individuals appears next among the specific disability groups in 
distance from the general population* With the exception of 
those who take both untimed and standard administrations, the 
admissions test performance of these examinees is appreciably 
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lower than that of their nondlsabled peers (generally by at least 
•5 standard deviations). Similar performance differences are 
reported for individually administered tests such as the Wechsler 
^§4es (Full Scale and Verbal IQ) and the Wide Range Achievment 
Tes t . 

Hearing impaired students perform the least well of all 
grpups on admissions tests* They also show performance well 
below that of national norms on the Verbal portion of the WAIS 
and on the reading comprehension and mathematics computation 
sections of the Stanford Achievement Test for Hearing Impaired 
Students* Their performance is consistently more different from 
the nondisab^lsed population on verbal than on math or performance 
measures* The poor performance of this group on admissions tests 
and the effects such performance may have on college and graduate 
admissions, make further investigation of the fairness of 
admissions tests for this group (as well as for those with 
learning disabilities) critical. 

The limited data on the reliability of admissions tests show 
no trustworthy uifferences between specific disability groups and 
the general test-taking population (see Table 2). The fact that 
these estimates were derived by correlating scores from 
dissimilar administrations (i«e«i timed and untimed), however, 
suggests the need for further investigation before definitive 
conclusions can be offered* Finally, data from studies on 
several school ability and achievement tests suggest comparable 
measurement precision between general and visually impaired, 
deaf, and learning disabled examinees. 
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The results of the small number of admissions test validity 
Investigations are presented in Table 3. Data on the ACT 
Assessment support the validity of this measure as an equivalent 
predictor of college performance among disabled students in 
general and nonhandicapped examinees. Two studies of the SAT at 
individual Institution* show no significant differences in 
validity coefficients for deaf and learning disabled students 
compared to their nonhandicapped peers. Finally, the factor 
structure of the WISC-R for learning disabled examinees has been 
consistently demonstrated to mirror that for the general 
population. 



Insert Table 3 about here 



The last observation to be made from this review relates to 
the paucity of relevant, dependable data. The investigations 
reviewed in the paper were located through extensive searches of 
the ERIC and Psychological Abstracts data bases, journals such as 
the Journal of^ Learning Disabilities and American Annals of the 
Deaf , and standard texts such as Settler's (1982) Assessment o£ 
Children's Intelligence and Special Abilities . Because of the 
large amount of special education testing done in the nation's 
schools, reports were also obtained from several large school 
districts. Despite this search, little dependable data on the 
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cognitive test performance of handicapped students and the 
psychometric characteristics of tesZ^ used with these Individuals 
were found. 

The major Implication of this lack of data Is that 
defensible conclusions cannot yet be drawn about the fairness of 
admissions and similar tests for assessing the abilities of 
handicapped people* The lack of data leave unanswered many 
Important questions that must be resolved before definitive 
statements can be made. For onei the lack of data leave 
undefined the exact nature of the subgroup of handicapped persons 
taking admissions tests. This subgrqup may be more or less 
select than the subgroup of nondlsabled students who take 
admissions tests. Differences In the nature of these two 
subgroups may result In estimates of test performance» 
reliability, and validity that differ, but that are not 
representative of the values that characterize the populatloxts 
from which these subgroups come. 

A second unanswered question relates to the cause of those 
differences In test performance that are determined to 
distinguish handicapped and nondlsabled populations. Such 
differences may be the result of test bias, as when disparities 
In tested performance across groups are not reflected In 
similarly d.^^ferent levels of future academic accomplishment. 
Alternatively, tested differences may reflect real disparities In 
the abilities of handicapped and nondlsabled people. Such 
differences would be expected to manifest themselves In poorer 
academic performance and, hence, would not Imply the existence of 
bias in admissions tests. 

32 41 



To resolve these questions, further studies of test 
performance and of the reliability, predictive validity, and 
factor structure of the scholastic measures used with disabled 
students are badly needed. The research program being undertaken 
by the College Board, Educational Testing Service, and the 
Graduate Record Examinations Board Includes many of these 
Investigations. The collection of such Information should 
provide a better basis for evaluating the fairness of tests for 
disabled Individuals and for Improving the admissions testing 
process . 
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Footnotes 



The authors are Hated in alphabetical order. Majorle Ragosta 
drafted the sections on the hearing Impaired and the visually 
Impaired. Those on the learning disabled and physically 
handicapped were drafted by Lawrence J. Strieker. Randy 
Elliot Bennett developed the Introduction, the section on 
handicapped students In general, the summaries, and the 
conclusion, and completed the manuscript's final draft. 



Table 1 

Differences Between The Admissions Test Performance 
of Handicapped and Nondisabled Examinees^ 



Group 



Median and Range of 
Differences in SD Units 



Physically Handicapped 
Visually Impaired 
Learning Disabled 
Hearing Impaired 



-.15 (-.33 to +.19) 

-.16 (-.50 to +.24) 

-.57 (-.80 to -.12) 

-.88 (-1.17 to -.57) 



* Data are from studies of the SAT, GRE, and ACT Assessment 
described in this paper. Performance differences are expressed 
In standard deviation units of the nondisabled group and include 
results of both standard and nonstandard administrations. 
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Table 2 



Reliability Estimates Associated with the 

Admissions Te'st Performance 
of Handicapped and Nondj^sabled Examinees^ 



Reliability Estimate 
Group SAT-V SAT-M 



Nondlsabled .88-, 91 .85-. 88 

Visually Imp^^ed .83 c .84 

Physically Handlcap.ped .83 .86 

Learning Disabled .76 .84 

Hearing Impaired .89 .79 

Handicapped Students In General .79 .85 



^ Data on handicapped student groups are from Centra (1983). 
Data on nondlsabled students are from Donlon and Angoff (1971). 
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Table 3 

Validity Estimates Associated with the 
Admissions Test Performance 
of Handicapped and Nondisabled Examinees 



Validity Estimate 



Group 



SAT-V 



SAT-M 



ACT 



Jones & Ragosta 

Nondisabled \ 
Learning Disabled 



34 
32 



28 
29 



Nondisabled 
Hearing Impaired 



38 
14 



32 
41 



Maxey & Levltz (1980) 

Nond Isabled 
Handicapped Students 
In General 



/44 
.46 
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